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ABSTRACT 
Social media platforms offer avenues for fostering anonymous online connections, discussions on diverse topics like 
culture, politics, and community life. However, the proliferation of hate speech poses a pressing challenge for 
society, individuals, policymakers, and researchers alike, both on the continent and globally. Addressing this issue 
necessitates comprehensive studies to identify and combat hate speech effectively. This paper conducts a systematic 
review of literature in this domain, concentrating on methodologies such as word embedding, machine learning, 
deep learning, and cutting-edge technologies. Specifically focusing on the past six years of research, this review 
highlights gaps, challenges, and advancements in hate speech detection techniques. Additionally, it delves into 
limitations, algorithmic selection dilemmas, data collection complexities, cleaning challenges, and outlines future 
research pathways in this critical area. 
Keywords: Hate Speech Detection, Machine Learning, Social Media Platforms, Text Analysis, Algorithm 
Selection. 


INTRODUCTION 
The rapid adoption of social media has revolutionized communication, leading to extensive information 
dissemination. In Ethiopia, social media serves as a primary information source, but its unverified content poses 
challenges [1]. Notably, during the Tigray conflict, social media played a role in civil disruptions, road closures, 
displacements, and loss of lives [2]. Instances of inciting violence against ethnic minorities circulated online [3]. 
The conflict also led to media shutdowns and arrests of journalists [4]. Globally, hate speech and fake news have 
raised concerns, prompting some countries to balance free speech with preventing its dangerous evolution [5]. 
UNESCO outlines strategies like media ethics education, conflict-sensitive reporting, social media oversight, 
reporting mechanisms, and fighting impunity for hate crimes [6]. This scientific investigation explores hate 
speech's contemporary landscape and Ethiopian language-specific challenges. Addressing hate speech is critical 
globally, impacting lives and historical narratives [7]. Public awareness and concerted efforts are essential for 
citizens' safety [8]. Ethiopia faced turmoil due to misinformation on social media, affecting education and trade 
[9]. A scientific solution derived from comprehensive research is crucial. This survey identifies current challenges, 
integrates existing knowledge, and guides future research. 
Hate Speech 

Hate speech encompasses any communication—verbal, written, or through actions—that employs offensive or 
discriminatory language targeting individuals or groups based on attributes like religion, ethnicity, race, gender, or 
other defining factors [10, 11]. It disrupts community harmony and perpetuates discrimination. In response to 
rising incidents of hate speech, the UN introduced the UN Strategy and Plan of Action on Hate Speech in 2019, 
aiming to address root causes and societal impacts through thirteen commitments, including victim support, social 
media awareness, and education [12]. Social media's pivotal role in daily communication, information 
dissemination, and entertainment amplifies hate speech's impact. Studies exist to mitigate this issue, and major 
platforms like Facebook, Twitter, and YouTube employ models to distinguish hate speech from genuine content 
[13]. However, diverse languages and interpretations pose ongoing challenges in hate speech identification, which 
often targets multiple aspects of identity, including gender, religion, race, and disability [14]. 
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Gender-based Hate Speech 
Expressions disseminating, endorsing, or provoking hatred based on gender fall under gender-based hate speech, 
primarily targeting women and girls [15]. This form, known as sexist hate speech, condemns and undermines 
women, fostering fear and mistrust toward them in society. Technological advancements and social media 
accessibility have escalated violence against women and girls [16]. Online violence against women, primarily 
facilitated by social networks, significantly impacts their personal lives and professional careers [16]. Research 
indicates that this harassment could drive women to join extremist groups [17]. Additionally, [18] identified high 
instances of cyber harassment against women on social media platforms. This research paper delves into the 
pervasive issue of online bullying, particularly affecting young adults who experience severe forms of harassment. 
Recognizing the urgency, the paper aims to fill research gaps and propose solutions to combat this escalating 
problem. 
Religious Hate Speech 
Hatred directed at religions such as Islam, Hinduism, and Christianity constitutes a severe form of hate speech, 
impacting groups rather than individuals [19]. Extremist individuals within these religions face negative 
stereotypes, discrimination, and online abuse, particularly the rising trend of anti-Muslim abuse [20]. The 
internet's role as an amplifier intensifies existing discourses, fostering polarization [20]. Social media platforms 
serve as a breeding ground for illegal activities, fueling misunderstandings, intolerance, and heated religious 
debates, leading to tensions among followers [21]. This phenomenon spans Europe, Asia, and Africa [217]. Hate 
speech online exacerbates religious tensions, fueling extreme animosity [22]. History has shown the dangers 
when religion is manipulated for political motives, especially in Ethiopia's diverse religious landscape, where 
conflicts can have extensive consequences. Rigorous scientific research is imperative to prevent such incidents from 
recurring. 
Racist Hate Speech 
Racist hate speech targets a person or group based on appearance and fosters the belief in the inferiority of 
certain racial groups [23]. Such expressions often occur at the international level, influenced by governmental 
attitudes and leadership, varying in frequency and impact. The concerning rise of individuals propagating racial 
hatred on social media signifies a growing population that rejects equality [24]. Identifying these individuals 
requires a scientific approach to address moral degradation and promote equality. This compilation of studies 
offers crucial insights to tackle this urgent issue. 
Hate speech on disability 
Disability hate speech seeks to dehumanize individuals with physical or mental disabilities, viewing disability as a 
societal construct akin to race or gender [25]. Disabilities can arise from various conditions, including medical 
errors, accidents, or natural causes. Unfortunately, online social media users perpetuate hate speech that harms 
individuals with disabilities, impacting their living conditions and inclusion within communities [26]. This 
hinders their interaction and creates discord. Recognizing equality as a fundamental human value is vital, 
necessitating educational efforts to promote self-improvement, unity, and the disregard of disabilities. 
Stages of Hate Speech 
According to [27], hate speech follows a four-stage process. Initially, the "influence stage" emerges, characterized 
by heightened social media activity after an event, intensifying the spread of hate speech. This is succeeded by the 
"intervention stage," where the event's impact diminishes over several days. Subsequently, a gradual decline leads 
to the "response stage," where the impact reaches zero. The final "rebirth stage," represented by a dashed line in 
the illustration, is an optional phase where hate speech may resurface or not based on subsequent occurrences' 
nature and impact [27]. 
Hate Speech Techniques 
The detection of hate speech has been predominantly approached through three primary methodologies: 
Keyword-Based Techniques, Machine Learning Techniques, and Deep Learning Techniques, often used 
individually or in hybrid forms. Researchers have extensively investigated the prevalence and usage patterns of 
these methodologies in hate speech detection. The overall composition is given below: 
Keyword-Based Technique 
As discussed in [28], the keyword-based method is a foundational approach in identifying hate speech. This 
technique involves scanning text for potentially offensive keywords through a predefined dictionary, often collected 
from diverse social media platforms like Facebook, Twitter, blogs, forums, and YouTube. However, this approach 
has limitations. While these keywords are associated with repugnant actions, their context and meaning can evolve 
over time, rendering them insufficient for comprehensive hate speech detection. Content devoid of explicit slurs 
might not be flagged, posing a challenge to this method. For instance, the phrase "umu anumanu e bilie" might 
have a benign literal meaning but could hold different implications in various contexts, evolving over time based 
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on interpretation. Additionally, as highlighted in [29], keyword-based techniques struggle to detect hate speech 
conveyed through metaphors or slang lacking explicit hate keywords. For instance, the slang expression "umu inya 
amaghi egwuregwu," literally translating to "donkeys are not careful when playing," may hold a different, 
potentially contentious meaning in a political or religious context. 
Machine Learning Technique 
Described in [30], machine learning is a scientific methodology harnessed by computer systems that leverage 
algorithms and statistical models to efficiently perform tasks. Unlike explicit programming, it relies on patterns 
and data to operate. As a subset of artificial intelligence, machine learning algorithms develop a mathematical 
model based on training data, allowing them to make predictions or decisions without specific task-oriented 
programming. The primary goal is to create a classifier or regression model by learning from a training dataset, 
followed by evaluating its performance using test data. Machine learning encompasses supervised, unsupervised, 
and semi-supervised learning methodologies. 
Deep Learning Technique 
Described in [31], deep learning is a machine learning technique that emulates human learning by using 
examples. This approach employs neural networks to tackle complex problems. Through deep learning, computer 
models can directly perform classification tasks from inputs such as images, text, or sound, achieving notably high 
levels of accuracy. Training these models requires extensive labeled data and intricate, multi-layered neural 
network structures [32]. 
Hybrid Technique 
This method is used to overcome the limitations inherent in a single approach. By merging two or more 
approaches into a hybrid method, leveraging their strengths to complement each other, it presents a promising 
solution. 
Keyword-Based Technique 
The technique involves the collection and categorization of keywords within a specific context, assessing their 
frequency within a document to understand its content. Illustrated below: In their study as detailed in [33], an 
analysis was conducted on the Ethiopic Twitter Dataset for Abusive Speech in Amharic. The study aimed to gather 
data for training linguistic models in language identification tasks and to examine the prevalence of specific 
keywords linked to abusive language. The textual data encompassed the Amharic, Tigrinya, and Ge’ez languages, 
totaling approximately three million tweets from 154,477 users between mid-August 2014 and 2019. Native 
speakers compiled 99 hate speech and 48 offensive speech keywords for the Amharic language, sourced from 
Facebook, Twitter, and YouTube comments. Excluding data from 2015 due to encoding issues, the assembled 14:7 
keywords were categorized into hate speech and offensive speech. The analysis revealed an increasing trend in both 
the number of Amharic tweets and the prevalence of tweets containing offensive keywords over time. 
Machine Learning Techniques 
The study outlined in [34] centered on assessing data quality, focusing on hate speech within online discourse. It 
commenced by utilizing a hate speech lexicon from Hatebase.org, compiling words and phrases identified as hate 
speech by Internet users. Employing the Twitter API, the researchers searched for tweets containing lexicon 
terms, amassing a sample from 33,458 Twitter users, resulting in about 84.4 million tweets. Of these, 25,000 
tweets containing lexicon terms were manually labeled by Crowd Flower workers. Workers were tasked with 
categorizing each tweet as hate speech, offensive yet not hateful, or neither offensive nor hate speech, considering 
the context in which the words were used. Using an inter-coder agreement approach involving three or more 
workers, 24,802 samples were considered, as most tweets lacked consensus among coders. From these, 5% were 
categorized as hate speech by majority consensus, with 1.3% deemed as such without opposition from coders. As 
the study employed stringent criteria for hate speech classification, most tweets were labeled as offensive language, 
with a minority classified as non-offensive. The study employed the Porter stemming algorithm and TF-IDF for 
analysis, evaluating tweet quality through modified Flesch-Kincaid Grade Level and Flesch Reading Ease scores. 
Five classical algorithms—logistic regression, naive Bayes, decision trees, random forests, and linear SVMs—were 
employed, with Grid Search used to optimize parameters. To prevent overfitting, 5-fold cross-validation was 
employed, highlighting logistic regression and linear SVM as the top-performing models. Ultimately, logistic 
regression was chosen, exhibiting an overall precision of 0.91, recall of 0.90, and an F1 score of 0.90. This study's 
focus on data quality aimed to comprehend the context in which offensive language and hate speech impact race, 
religion, and societal identity. 
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Deep Learning Techniques 
The work presented in [35] aimed to create an automated Amharic Hate Speech Posts and Comments Detection 
Model using Recurrent Neural Networks (RNNs). It began with a literature review covering various approaches to 
online hate speech, including legal perspectives addressing hate speech on social media across different continents 
and countries. A dataset of 30,000 manually collected posts and comments from prominent activists and news 
pages was annotated into hate and free speech categories. The data underwent pre-processing through cleaning 
and normalization techniques. An RNN was developed utilizing LSTM and Gated Recurrent Unit (GRU), 
leveraging word n-grams for feature extraction and word2vec for word representation. The LSTM and GRU 
models were trained and tested on the dataset, split into training, validation, and test sets in an 80:10:10 ratio. 
Experimentation with different parameters on the GRU and LSTM-based RNN models using word2vec feature 
representation yielded a superior test accuracy of 97.9% achieved by RNN-LSTM. In [36], a study on Bangla 
hates speech detection on social media employed an attention-based recurrent neural network. Gathering 7,425 
comments from Facebook, the data was divided into 80% for training and 20% for testing. Encoder—decoder-based 
machine learning models, including attention mechanism, LSTM, and GRU-based decoders, were utilized to 
predict hate speech categories. Among these algorithms, the attention-based decoder attained the highest accuracy, 
reaching 77%. These studies utilized different recurrent neural network architectures for hate speech detection in 
Amharic and Bangla languages, achieving high accuracies through their respective approaches. 
Hybrid Techniques 
The study detailed in [37] focused on detecting and classifying Afaan Oromo hate speech on social media. 
Collecting 12,812 data instances from Afaan Oromo language-based Facebook accounts, various machine learning 
algorithms were applied, including classical (SVM and NB), ensemble (RF and XGBoost), and deep learning (CNN 
and BiLSTM), utilizing different feature extraction techniques such as BOW, TF-IDF, Word2vec, and embedding 
layers. Two experiments were conducted for eight and two-class classifications. SVM with Word2vec achieved 82% 
accuracy for eight-class classification among classical and ensemble machine learning algorithms, while BiLSTM 
with pretrained Word2vec obtained 84% accuracy for the same classification. Additionally, BiLSTM with 
pretrained Word2vec attained a performance result of 0.88% accuracy. Moreover, the authors suggested further 
research directions in hate speech detection for audio, video, emoji, and memes, particularly in multilingual 
contexts. In the same vein, the study analyzed hate speech identification in Hinglish language using transformer 
models mBERT and IndicBERT for feature selection, focusing on code-mixed tweets. They tested traditional 
machine learning classifiers (SVM, LR, RF, NB, KNN) on translated and transliterated Devanagari script using 
mBERT embeddings. Subsequently, they experimented with the Deep Neural Network (DNN) model. Their 
results showed that their model outperformed existing methods, achieving a 73% accuracy for hate speech 
identification in Hinglish. The mBERT model and traditional machine learning classifiers exhibited better 
performance compared to IndicBERT for hate content detection in Hindi and English datasets. Both studies 
demonstrated the effectiveness of various machine learning techniques, including hybrid approaches, in detecting 
hate speech across different languages and social media contexts [37]. 
Comparative Analysis 
The paper employs diverse methods and languages to detect hate speech content. It primarily utilizes classical 
machine learning algorithms and integrates them with deep learning methods either individually or in conjunction 
with other machine learning techniques. The amalgamation of deep learning with other algorithms enables the 
creation of models’ adept at effectively identifying hate speech. Despite the substantial data needs of deep learning 
algorithms, their showcased accuracy in the reviewed studies establishes them as a favoured option for hate speech 
detection. 
Findings 
Each language harbors its unique nuances, making a machine learning algorithm effective in one language 
potentially ineffective in another. However, by meticulously observing linguistic intricacies and conducting 
thorough research, gathering and organizing data for such studies is achievable. This survey primarily delves into 
reviewing research related to the identification and anticipation of hate speech in the Amharic language. It 
accentuates the imperative for more extensive research on Ethiopian languages. The survey illuminates significant 
gaps within the field, notably the absence of benchmarks and various natural language processing tools tailored to 
these languages. Furthermore, it highlights a deficiency in standardized data annotation practices, often leading to 
research outputs that may appear hurried and lacking depth. Despite the volume of studies conducted, a palpable 
dearth persists in the body of knowledge concerning Amharic and other Ethiopian languages. 
CONCLUSION 
This paper conducts a comprehensive survey focusing on the automated detection of hate speech. It explores 
diverse aspects such as terminology, hate speech stages, detection methods, features like word2vec and embeddings, 
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and model performance. The study scrutinizes various research methodologies, emphasizing their data 
preprocessing techniques and highlighting encountered challenges. Notably, the surveyed studies often neglect 
figurative language and visual content, despite their significant role in conveying hate messages. An important 
challenge identified pertains to data collection, primarily aggregating and evaluating data at the individual level, 
potentially overlooking hate speech manifested through bullying or offenses against ethnic minorities. To address 
these issues, the paper recommends the creation of a benchmark dataset for hate speech identification, aiming to 
facilitate better comparison across different features and methodologies. 
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